Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 4456615 |
| Missing cells | 2732652 |
| Missing cells (%) | 2.6% |
| Duplicate rows | 3299 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 850.0 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 13 |
|---|---|
| Categorical | 7 |
| Text | 4 |
activity_year has constant value "" | Constant |
| Dataset has 3299 (0.1%) duplicate rows | Duplicates |
debt_to_income_ratio is highly overall correlated with loan_outcome and 1 other fields | High correlation |
income is highly overall correlated with loan_amount and 1 other fields | High correlation |
lender_size is highly overall correlated with lender_type and 1 other fields | High correlation |
lender_type is highly overall correlated with lender_size | High correlation |
loan_amount is highly overall correlated with income and 1 other fields | High correlation |
loan_outcome is highly overall correlated with debt_to_income_ratio | High correlation |
mortgage_term is highly overall correlated with debt_to_income_ratio and 1 other fields | High correlation |
property_value_ratio is highly overall correlated with income and 1 other fields | High correlation |
mortgage_term is highly imbalanced (70.4%) | Imbalance |
property_value_ratio has 977860 (21.9%) missing values | Missing |
combined_loan_to_value_ratio has 793452 (17.8%) missing values | Missing |
white_population_pct has 167194 (3.8%) missing values | Missing |
metro_name has 295289 (6.6%) missing values | Missing |
state_code has 165896 (3.7%) missing values | Missing |
county_code has 165896 (3.7%) missing values | Missing |
census_tract has 167065 (3.7%) missing values | Missing |
income is highly skewed (γ1 = 524.6253903) | Skewed |
loan_amount is highly skewed (γ1 = 96.84012168) | Skewed |
property_value_ratio is highly skewed (γ1 = 1624.467788) | Skewed |
metro_size_percentile has 336394 (7.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-18 16:22:01.508046 |
|---|---|
| Analysis finished | 2024-03-18 16:30:40.234958 |
| Duration | 8 minutes and 38.73 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
race
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.9787626 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.2518588 |
|---|---|
| Coefficient of variation (CV) | 0.25143975 |
| Kurtosis | 0.96360276 |
| Mean | 4.9787626 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.81787609 |
| Sum | 22188428 |
| Variance | 1.5671505 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 2726798 | |
| 6 | 558572 | 12.5% |
| 7 | 506449 | 11.4% |
| 3 | 335051 | 7.5% |
| 2 | 299736 | 6.7% |
| 1 | 22266 | 0.5% |
| 4 | 7743 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 22266 | 0.5% |
| 2 | 299736 | 6.7% |
| 3 | 335051 | 7.5% |
| 4 | 7743 | 0.2% |
| 5 | 2726798 | |
| 6 | 558572 | 12.5% |
| 7 | 506449 | 11.4% |
| Value | Count | Frequency (%) |
| 7 | 506449 | 11.4% |
| 6 | 558572 | 12.5% |
| 5 | 2726798 | |
| 4 | 7743 | 0.2% |
| 3 | 335051 | 7.5% |
| 2 | 299736 | 6.7% |
| 1 | 22266 | 0.5% |
sex
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 6 | 2040 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4456615 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 2660576 | |
| 2 | 1517189 | |
| 3 | 276810 | 6.2% |
| 6 | 2040 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 2660576 | |
| 2 | 1517189 | |
| 3 | 276810 | 6.2% |
| 6 | 2040 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2660576 | |
| 2 | 1517189 | |
| 3 | 276810 | 6.2% |
| 6 | 2040 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4456615 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2660576 | |
| 2 | 1517189 | |
| 3 | 276810 | 6.2% |
| 6 | 2040 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4456615 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2660576 | |
| 2 | 1517189 | |
| 3 | 276810 | 6.2% |
| 6 | 2040 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4456615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2660576 | |
| 2 | 1517189 | |
| 3 | 276810 | 6.2% |
| 6 | 2040 | < 0.1% |
co_applicant
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 MiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 10667 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4456615 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 2502902 | |
| 1 | 1943046 | |
| 3 | 10667 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 2502902 | |
| 1 | 1943046 | |
| 3 | 10667 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2502902 | |
| 1 | 1943046 | |
| 3 | 10667 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4456615 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2502902 | |
| 1 | 1943046 | |
| 3 | 10667 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4456615 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2502902 | |
| 1 | 1943046 | |
| 3 | 10667 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4456615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2502902 | |
| 1 | 1943046 | |
| 3 | 10667 | 0.2% |
age
Real number (ℝ)
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2002174 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.3636566 |
|---|---|
| Coefficient of variation (CV) | 0.42611372 |
| Kurtosis | -0.15562359 |
| Mean | 3.2002174 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.65531119 |
| Sum | 14262137 |
| Variance | 1.8595592 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1412786 | |
| 3 | 1200894 | |
| 4 | 782056 | |
| 5 | 508997 | 11.4% |
| 1 | 243907 | 5.5% |
| 6 | 241727 | 5.4% |
| 7 | 63579 | 1.4% |
| 8 | 2669 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 243907 | 5.5% |
| 2 | 1412786 | |
| 3 | 1200894 | |
| 4 | 782056 | |
| 5 | 508997 | 11.4% |
| 6 | 241727 | 5.4% |
| 7 | 63579 | 1.4% |
| 8 | 2669 | 0.1% |
| Value | Count | Frequency (%) |
| 8 | 2669 | 0.1% |
| 7 | 63579 | 1.4% |
| 6 | 241727 | 5.4% |
| 5 | 508997 | 11.4% |
| 4 | 782056 | |
| 3 | 1200894 | |
| 2 | 1412786 | |
| 1 | 243907 | 5.5% |
income
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 3863 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 114.75254 |
| Minimum | 1 |
|---|---|
| Maximum | 405346 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 32 |
| Q1 | 55 |
| median | 84 |
| Q3 | 130 |
| 95-th percentile | 275 |
| Maximum | 405346 |
| Range | 405345 |
| Interquartile range (IQR) | 75 |
Descriptive statistics
| Standard deviation | 518.73307 |
|---|---|
| Coefficient of variation (CV) | 4.5204495 |
| Kurtosis | 343346.1 |
| Mean | 114.75254 |
| Median Absolute Deviation (MAD) | 34 |
| Skewness | 524.62539 |
| Sum | 5.1140789 × 108 |
| Variance | 269083.99 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60 | 56566 | 1.3% |
| 50 | 52742 | 1.2% |
| 52 | 49624 | 1.1% |
| 65 | 48227 | 1.1% |
| 55 | 47466 | 1.1% |
| 62 | 46740 | 1.0% |
| 70 | 46169 | 1.0% |
| 48 | 45962 | 1.0% |
| 75 | 45433 | 1.0% |
| 42 | 45321 | 1.0% |
| Other values (3853) | 3972365 |
| Value | Count | Frequency (%) |
| 1 | 824 | |
| 2 | 1264 | |
| 3 | 1704 | |
| 4 | 1730 | |
| 5 | 1635 | |
| 6 | 1460 | |
| 7 | 1314 | |
| 8 | 1209 | |
| 9 | 1320 | |
| 10 | 1273 |
| Value | Count | Frequency (%) |
| 405346 | 1 | |
| 400000 | 1 | |
| 365001 | 1 | |
| 360000 | 1 | |
| 323648 | 1 | |
| 250000 | 1 | |
| 235210 | 1 | |
| 200000 | 1 | |
| 189000 | 1 | |
| 167923 | 1 |
loan_amount
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 733 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 291204.66 |
| Minimum | -1.292105 × 109 |
|---|---|
| Maximum | 1.106255 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | -1.292105 × 109 |
|---|---|
| 5-th percentile | 85000 |
| Q1 | 155000 |
| median | 235000 |
| Q3 | 345000 |
| 95-th percentile | 665000 |
| Maximum | 1.106255 × 109 |
| Range | 2.39836 × 109 |
| Interquartile range (IQR) | 190000 |
Descriptive statistics
| Standard deviation | 1066865.1 |
|---|---|
| Coefficient of variation (CV) | 3.6636267 |
| Kurtosis | 915691.82 |
| Mean | 291204.66 |
| Median Absolute Deviation (MAD) | 90000 |
| Skewness | 96.840122 |
| Sum | 1.297787 × 1012 |
| Variance | 1.1382012 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 205000 | 157354 | 3.5% |
| 155000 | 151759 | 3.4% |
| 165000 | 150651 | 3.4% |
| 175000 | 146826 | 3.3% |
| 185000 | 146274 | 3.3% |
| 225000 | 145148 | 3.3% |
| 195000 | 143153 | 3.2% |
| 145000 | 139658 | 3.1% |
| 215000 | 138049 | 3.1% |
| 125000 | 131331 | 2.9% |
| Other values (723) | 3006412 |
| Value | Count | Frequency (%) |
| -1292105000 | 1 | < 0.1% |
| 5000 | 2220 | < 0.1% |
| 15000 | 3311 | 0.1% |
| 25000 | 7301 | 0.2% |
| 35000 | 13513 | 0.3% |
| 45000 | 23214 | 0.5% |
| 55000 | 41856 | |
| 65000 | 52451 | |
| 75000 | 65259 | |
| 85000 | 75361 |
| Value | Count | Frequency (%) |
| 1106255000 | 1 | < 0.1% |
| 899805000 | 1 | < 0.1% |
| 660005000 | 1 | < 0.1% |
| 568005000 | 1 | < 0.1% |
| 410475000 | 1 | < 0.1% |
| 396005000 | 1 | < 0.1% |
| 46025000 | 2 | |
| 42735000 | 3 | |
| 35005000 | 1 | < 0.1% |
| 29005000 | 1 | < 0.1% |
property_value_ratio
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 11833 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 977860 |
| Missing (%) | 21.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3997054 |
| Minimum | 0.008 |
|---|---|
| Maximum | 12967.896 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 0.008 |
|---|---|
| 5-th percentile | 0.556 |
| Q1 | 0.885 |
| median | 1.175 |
| Q3 | 1.614 |
| 95-th percentile | 2.852 |
| Maximum | 12967.896 |
| Range | 12967.888 |
| Interquartile range (IQR) | 0.729 |
Descriptive statistics
| Standard deviation | 7.3176278 |
|---|---|
| Coefficient of variation (CV) | 5.2279771 |
| Kurtosis | 2843300.5 |
| Mean | 1.3997054 |
| Median Absolute Deviation (MAD) | 0.34 |
| Skewness | 1624.4678 |
| Sum | 4869232.1 |
| Variance | 53.547676 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.134 | 6952 | 0.2% |
| 0.942 | 6870 | 0.2% |
| 1.057 | 6788 | 0.2% |
| 0.903 | 6130 | 0.1% |
| 1.018 | 5989 | 0.1% |
| 0.98 | 5860 | 0.1% |
| 0.994 | 5780 | 0.1% |
| 1.229 | 5717 | 0.1% |
| 1.318 | 5553 | 0.1% |
| 1.02 | 5492 | 0.1% |
| Other values (11823) | 3417624 | |
| (Missing) | 977860 | 21.9% |
| Value | Count | Frequency (%) |
| 0.008 | 1 | < 0.1% |
| 0.009 | 1 | < 0.1% |
| 0.01 | 1 | < 0.1% |
| 0.011 | 2 | < 0.1% |
| 0.013 | 3 | < 0.1% |
| 0.014 | 4 | < 0.1% |
| 0.016 | 2 | < 0.1% |
| 0.017 | 17 | |
| 0.018 | 1 | < 0.1% |
| 0.019 | 10 |
| Value | Count | Frequency (%) |
| 12967.896 | 1 | |
| 3010.999 | 1 | |
| 1832.974 | 1 | |
| 646.663 | 1 | |
| 486.491 | 1 | |
| 454.295 | 1 | |
| 418.116 | 1 | |
| 390.051 | 1 | |
| 336.373 | 1 | |
| 319.423 | 1 |
mortgage_term
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 MiB |
| 1 | |
|---|---|
| 2 | 266458 |
| 4 | 139851 |
| 3 | 35670 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4456615 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 4 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 4014636 | |
| 2 | 266458 | 6.0% |
| 4 | 139851 | 3.1% |
| 3 | 35670 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 4014636 | |
| 2 | 266458 | 6.0% |
| 4 | 139851 | 3.1% |
| 3 | 35670 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4014636 | |
| 2 | 266458 | 6.0% |
| 4 | 139851 | 3.1% |
| 3 | 35670 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4456615 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4014636 | |
| 2 | 266458 | 6.0% |
| 4 | 139851 | 3.1% |
| 3 | 35670 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4456615 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 4014636 | |
| 2 | 266458 | 6.0% |
| 4 | 139851 | 3.1% |
| 3 | 35670 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4456615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4014636 | |
| 2 | 266458 | 6.0% |
| 4 | 139851 | 3.1% |
| 3 | 35670 | 0.8% |
credit_model
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.367178 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.2914832 |
|---|---|
| Coefficient of variation (CV) | 0.68053522 |
| Kurtosis | -1.1140693 |
| Mean | 3.367178 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.67296203 |
| Sum | 15006216 |
| Variance | 5.2508953 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1145712 | |
| 7 | 1084625 | |
| 3 | 1083595 | |
| 2 | 913791 | |
| 5 | 174254 | 3.9% |
| 6 | 49970 | 1.1% |
| 4 | 4668 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1145712 | |
| 2 | 913791 | |
| 3 | 1083595 | |
| 4 | 4668 | 0.1% |
| 5 | 174254 | 3.9% |
| 6 | 49970 | 1.1% |
| 7 | 1084625 |
| Value | Count | Frequency (%) |
| 7 | 1084625 | |
| 6 | 49970 | 1.1% |
| 5 | 174254 | 3.9% |
| 4 | 4668 | 0.1% |
| 3 | 1083595 | |
| 2 | 913791 | |
| 1 | 1145712 |
debt_to_income_ratio
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7698675 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7651503 |
|---|---|
| Coefficient of variation (CV) | 0.63726885 |
| Kurtosis | -0.74142445 |
| Mean | 2.7698675 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.74257001 |
| Sum | 12344233 |
| Variance | 3.1157554 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1441931 | |
| 2 | 937076 | |
| 3 | 864422 | |
| 6 | 728670 | |
| 4 | 359716 | 8.1% |
| 5 | 124800 | 2.8% |
| Value | Count | Frequency (%) |
| 1 | 1441931 | |
| 2 | 937076 | |
| 3 | 864422 | |
| 4 | 359716 | 8.1% |
| 5 | 124800 | 2.8% |
| 6 | 728670 |
| Value | Count | Frequency (%) |
| 6 | 728670 | |
| 5 | 124800 | 2.8% |
| 4 | 359716 | 8.1% |
| 3 | 864422 | |
| 2 | 937076 | |
| 1 | 1441931 |
combined_loan_to_value_ratio
Text
MISSING 
| Distinct | 96094 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 793452 |
| Missing (%) | 17.8% |
| Memory size | 68.0 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 4 |
| Mean length | 4.5862248 |
| Min length | 3 |
Characters and Unicode
| Total characters | 16800089 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 30951 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Exempt |
|---|---|
| 2nd row | Exempt |
| 3rd row | Exempt |
| 4th row | Exempt |
| 5th row | Exempt |
| Value | Count | Frequency (%) |
| 80.0 | 607409 | |
| 96.5 | 495898 | |
| 95.0 | 482994 | 13.2% |
| 90.0 | 244033 | 6.7% |
| 97.0 | 237489 | 6.5% |
| exempt | 124872 | 3.4% |
| 85.0 | 81708 | 2.2% |
| 100.0 | 76814 | 2.1% |
| 75.0 | 63735 | 1.7% |
| 70.0 | 29910 | 0.8% |
| Other values (96084) | 1218301 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 3538291 | |
| 0 | 3388392 | |
| 9 | 2378346 | |
| 5 | 1582202 | |
| 8 | 1330971 | 7.9% |
| 6 | 1027400 | 6.1% |
| 7 | 887581 | 5.3% |
| 1 | 643855 | 3.8% |
| 4 | 499906 | 3.0% |
| 3 | 387643 | 2.3% |
| Other values (7) | 1135502 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 12512562 | |
| Other Punctuation | 3538291 | 21.1% |
| Lowercase Letter | 624360 | 3.7% |
| Uppercase Letter | 124876 | 0.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3388392 | |
| 9 | 2378346 | |
| 5 | 1582202 | |
| 8 | 1330971 | 10.6% |
| 6 | 1027400 | 8.2% |
| 7 | 887581 | 7.1% |
| 1 | 643855 | 5.1% |
| 4 | 499906 | 4.0% |
| 3 | 387643 | 3.1% |
| 2 | 386266 | 3.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 124872 | |
| p | 124872 | |
| t | 124872 | |
| e | 124872 | |
| x | 124872 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3538291 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 124876 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16050853 | |
| Latin | 749236 | 4.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 3538291 | |
| 0 | 3388392 | |
| 9 | 2378346 | |
| 5 | 1582202 | |
| 8 | 1330971 | 8.3% |
| 6 | 1027400 | 6.4% |
| 7 | 887581 | 5.5% |
| 1 | 643855 | 4.0% |
| 4 | 499906 | 3.1% |
| 3 | 387643 | 2.4% |
Latin
| Value | Count | Frequency (%) |
| E | 124876 | |
| m | 124872 | |
| p | 124872 | |
| t | 124872 | |
| e | 124872 | |
| x | 124872 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16800089 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 3538291 | |
| 0 | 3388392 | |
| 9 | 2378346 | |
| 5 | 1582202 | |
| 8 | 1330971 | 7.9% |
| 6 | 1027400 | 6.1% |
| 7 | 887581 | 5.3% |
| 1 | 643855 | 3.8% |
| 4 | 499906 | 3.0% |
| 3 | 387643 | 2.3% |
| Other values (7) | 1135502 | 6.8% |
main_underwriter
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7321781 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.3011631 |
|---|---|
| Coefficient of variation (CV) | 0.84224491 |
| Kurtosis | -0.6692483 |
| Mean | 2.7321781 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.0001502 |
| Sum | 12176266 |
| Variance | 5.2953517 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2259479 | |
| 7 | 714174 | 16.0% |
| 2 | 568733 | 12.8% |
| 3 | 530435 | 11.9% |
| 6 | 270456 | 6.1% |
| 5 | 112710 | 2.5% |
| 4 | 628 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 2259479 | |
| 2 | 568733 | 12.8% |
| 3 | 530435 | 11.9% |
| 4 | 628 | < 0.1% |
| 5 | 112710 | 2.5% |
| 6 | 270456 | 6.1% |
| 7 | 714174 | 16.0% |
| Value | Count | Frequency (%) |
| 7 | 714174 | 16.0% |
| 6 | 270456 | 6.1% |
| 5 | 112710 | 2.5% |
| 4 | 628 | < 0.1% |
| 3 | 530435 | 11.9% |
| 2 | 568733 | 12.8% |
| 1 | 2259479 |
tract_to_metro_income_percentage
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 MiB |
| 3 | |
|---|---|
| 4 | |
| 2 | |
| 5 | 172159 |
| 1 | 106629 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4456615 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 1843814 | |
| 4 | 1699116 | |
| 2 | 634897 | 14.2% |
| 5 | 172159 | 3.9% |
| 1 | 106629 | 2.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 1843814 | |
| 4 | 1699116 | |
| 2 | 634897 | 14.2% |
| 5 | 172159 | 3.9% |
| 1 | 106629 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1843814 | |
| 4 | 1699116 | |
| 2 | 634897 | 14.2% |
| 5 | 172159 | 3.9% |
| 1 | 106629 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4456615 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1843814 | |
| 4 | 1699116 | |
| 2 | 634897 | 14.2% |
| 5 | 172159 | 3.9% |
| 1 | 106629 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4456615 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1843814 | |
| 4 | 1699116 | |
| 2 | 634897 | 14.2% |
| 5 | 172159 | 3.9% |
| 1 | 106629 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4456615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1843814 | |
| 4 | 1699116 | |
| 2 | 634897 | 14.2% |
| 5 | 172159 | 3.9% |
| 1 | 106629 | 2.4% |
lender_type
Categorical
HIGH CORRELATION 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 MiB |
| 3 | |
|---|---|
| 1 | |
| 2 | |
| 4 | 12658 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4456615 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 2599609 | |
| 1 | 1438863 | |
| 2 | 405485 | 9.1% |
| 4 | 12658 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 2599609 | |
| 1 | 1438863 | |
| 2 | 405485 | 9.1% |
| 4 | 12658 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 2599609 | |
| 1 | 1438863 | |
| 2 | 405485 | 9.1% |
| 4 | 12658 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4456615 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 2599609 | |
| 1 | 1438863 | |
| 2 | 405485 | 9.1% |
| 4 | 12658 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4456615 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 2599609 | |
| 1 | 1438863 | |
| 2 | 405485 | 9.1% |
| 4 | 12658 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4456615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 2599609 | |
| 1 | 1438863 | |
| 2 | 405485 | 9.1% |
| 4 | 12658 | 0.3% |
lender_size
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 1986 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 145933.16 |
| Minimum | 1 |
|---|---|
| Maximum | 1026755 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 700 |
| Q1 | 5732 |
| median | 25732 |
| Q3 | 153737 |
| 95-th percentile | 774905 |
| Maximum | 1026755 |
| Range | 1026754 |
| Interquartile range (IQR) | 148005 |
Descriptive statistics
| Standard deviation | 246597.39 |
|---|---|
| Coefficient of variation (CV) | 1.6897969 |
| Kurtosis | 4.3166477 |
| Mean | 145933.16 |
| Median Absolute Deviation (MAD) | 24474 |
| Skewness | 2.2255056 |
| Sum | 6.503679 × 1011 |
| Variance | 6.0810274 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 774905 | 167281 | 3.8% |
| 410835 | 165883 | 3.7% |
| 1026755 | 149098 | 3.3% |
| 198516 | 102803 | 2.3% |
| 466552 | 89358 | 2.0% |
| 527621 | 84340 | 1.9% |
| 282102 | 77327 | 1.7% |
| 257847 | 63864 | 1.4% |
| 130400 | 61108 | 1.4% |
| 119458 | 52230 | 1.2% |
| Other values (1976) | 3443323 |
| Value | Count | Frequency (%) |
| 1 | 4 | < 0.1% |
| 2 | 4 | < 0.1% |
| 4 | 7 | |
| 5 | 6 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 7 | |
| 8 | 16 | |
| 9 | 5 | < 0.1% |
| 10 | 13 | |
| 11 | 9 |
| Value | Count | Frequency (%) |
| 1026755 | 149098 | |
| 774905 | 167281 | |
| 527621 | 84340 | |
| 466552 | 89358 | |
| 410835 | 165883 | |
| 380650 | 48475 | 1.1% |
| 308884 | 24797 | 0.6% |
| 302784 | 10059 | 0.2% |
| 282102 | 77327 | |
| 257847 | 63864 | 1.4% |
white_population_pct
Real number (ℝ)
MISSING 
| Distinct | 70413 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 167194 |
| Missing (%) | 3.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.427039 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 5128 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 12.976047 |
| Q1 | 51.464264 |
| median | 73.675337 |
| Q3 | 86.661654 |
| 95-th percentile | 95.529826 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 35.19739 |
Descriptive statistics
| Standard deviation | 25.342034 |
|---|---|
| Coefficient of variation (CV) | 0.38150178 |
| Kurtosis | -0.14374088 |
| Mean | 66.427039 |
| Median Absolute Deviation (MAD) | 15.538227 |
| Skewness | -0.89327985 |
| Sum | 2.8493353 × 108 |
| Variance | 642.21867 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5128 | 0.1% |
| 25.08549218 | 2287 | 0.1% |
| 38.83451625 | 1920 | < 0.1% |
| 64.63691767 | 1769 | < 0.1% |
| 75.02294104 | 1574 | < 0.1% |
| 69.87282823 | 1573 | < 0.1% |
| 42.08020433 | 1520 | < 0.1% |
| 71.10172718 | 1483 | < 0.1% |
| 53.94154736 | 1479 | < 0.1% |
| 44.09115572 | 1297 | < 0.1% |
| Other values (70403) | 4269391 | |
| (Missing) | 167194 | 3.8% |
| Value | Count | Frequency (%) |
| 0 | 5128 | |
| 0.01163196464 | 18 | < 0.1% |
| 0.01282709082 | 5 | < 0.1% |
| 0.01891431814 | 94 | < 0.1% |
| 0.02134927412 | 2 | < 0.1% |
| 0.02161694769 | 2 | < 0.1% |
| 0.02297794118 | 18 | < 0.1% |
| 0.02919708029 | 4 | < 0.1% |
| 0.03082614057 | 26 | < 0.1% |
| 0.03355704698 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 1025 | |
| 99.96020692 | 10 | < 0.1% |
| 99.95645548 | 41 | < 0.1% |
| 99.93152248 | 24 | < 0.1% |
| 99.92146597 | 30 | < 0.1% |
| 99.92142483 | 6 | < 0.1% |
| 99.91421218 | 43 | < 0.1% |
| 99.90821478 | 16 | < 0.1% |
| 99.89059081 | 27 | < 0.1% |
| 99.88502443 | 53 | < 0.1% |
metro_name
Text
MISSING 
| Distinct | 959 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 295289 |
| Missing (%) | 6.6% |
| Memory size | 68.0 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 36 |
| Mean length | 25.053283 |
| Min length | 7 |
Characters and Unicode
| Total characters | 104254880 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ames, IA |
|---|---|
| 2nd row | Mason City, IA |
| 3rd row | Mason City, IA |
| 4th row | Ames, IA |
| 5th row | Albert Lea, MN |
| Value | Count | Frequency (%) |
| tx | 390772 | 3.6% |
| ca | 377632 | 3.4% |
| fl | 351267 | 3.2% |
| ga | 148022 | 1.4% |
| new | 140225 | 1.3% |
| city | 139081 | 1.3% |
| il | 138121 | 1.3% |
| pa | 133649 | 1.2% |
| az | 128427 | 1.2% |
| mi | 124043 | 1.1% |
| Other values (1079) | 8880307 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7619290 | 7.3% |
| e | 6824047 | 6.5% |
| 6790220 | 6.5% | |
| n | 6615960 | 6.3% |
| o | 6190369 | 5.9% |
| - | 5758948 | 5.5% |
| r | 5052660 | 4.8% |
| l | 4584980 | 4.4% |
| i | 4481362 | 4.3% |
| t | 4251660 | 4.1% |
| Other values (52) | 46085384 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 65598830 | |
| Uppercase Letter | 21735593 | 20.8% |
| Space Separator | 6790220 | 6.5% |
| Dash Punctuation | 5758948 | 5.5% |
| Other Punctuation | 4371289 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7619290 | |
| e | 6824047 | |
| n | 6615960 | |
| o | 6190369 | |
| r | 5052660 | 7.7% |
| l | 4584980 | 7.0% |
| i | 4481362 | 6.8% |
| t | 4251660 | 6.5% |
| s | 3600630 | 5.5% |
| d | 2291205 | 3.5% |
| Other values (20) | 14086667 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2419369 | 11.1% |
| A | 2349171 | 10.8% |
| N | 1591349 | 7.3% |
| S | 1491949 | 6.9% |
| L | 1389532 | 6.4% |
| M | 1264273 | 5.8% |
| T | 1012041 | 4.7% |
| P | 967350 | 4.5% |
| W | 948610 | 4.4% |
| B | 943174 | 4.3% |
| Other values (16) | 7358775 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4161326 | |
| . | 185008 | 4.2% |
| / | 21386 | 0.5% |
| ' | 3569 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6790220 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5758948 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 87334423 | |
| Common | 16920457 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7619290 | 8.7% |
| e | 6824047 | 7.8% |
| n | 6615960 | 7.6% |
| o | 6190369 | 7.1% |
| r | 5052660 | 5.8% |
| l | 4584980 | 5.2% |
| i | 4481362 | 5.1% |
| t | 4251660 | 4.9% |
| s | 3600630 | 4.1% |
| C | 2419369 | 2.8% |
| Other values (46) | 35694096 |
Common
| Value | Count | Frequency (%) |
| 6790220 | ||
| - | 5758948 | |
| , | 4161326 | |
| . | 185008 | 1.1% |
| / | 21386 | 0.1% |
| ' | 3569 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104245168 | |
| None | 9712 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7619290 | 7.3% |
| e | 6824047 | 6.5% |
| 6790220 | 6.5% | |
| n | 6615960 | 6.3% |
| o | 6190369 | 5.9% |
| - | 5758948 | 5.5% |
| r | 5052660 | 4.8% |
| l | 4584980 | 4.4% |
| i | 4481362 | 4.3% |
| t | 4251660 | 4.1% |
| Other values (48) | 46075672 |
None
| Value | Count | Frequency (%) |
| ó | 8608 | |
| ñ | 716 | 7.4% |
| á | 228 | 2.3% |
| ü | 160 | 1.6% |
metro_size_percentile
Real number (ℝ)
ZEROS 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.189303 |
| Minimum | 0 |
|---|---|
| Maximum | 111 |
| Zeros | 336394 |
| Zeros (%) | 7.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 7 |
| median | 9 |
| Q3 | 9 |
| 95-th percentile | 111 |
| Maximum | 111 |
| Range | 111 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 35.385262 |
|---|---|
| Coefficient of variation (CV) | 1.5946991 |
| Kurtosis | 1.5245638 |
| Mean | 22.189303 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.8568124 |
| Sum | 98889179 |
| Variance | 1252.1168 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 1547716 | |
| 8 | 709136 | |
| 99 | 466958 | 10.5% |
| 7 | 421596 | 9.5% |
| 0 | 336394 | 7.5% |
| 111 | 242745 | 5.4% |
| 6 | 235461 | 5.3% |
| 5 | 162238 | 3.6% |
| 4 | 115719 | 2.6% |
| 3 | 93262 | 2.1% |
| Other values (2) | 125390 | 2.8% |
| Value | Count | Frequency (%) |
| 0 | 336394 | 7.5% |
| 1 | 55460 | 1.2% |
| 2 | 69930 | 1.6% |
| 3 | 93262 | 2.1% |
| 4 | 115719 | 2.6% |
| 5 | 162238 | 3.6% |
| 6 | 235461 | 5.3% |
| 7 | 421596 | 9.5% |
| 8 | 709136 | |
| 9 | 1547716 |
| Value | Count | Frequency (%) |
| 111 | 242745 | 5.4% |
| 99 | 466958 | 10.5% |
| 9 | 1547716 | |
| 8 | 709136 | |
| 7 | 421596 | 9.5% |
| 6 | 235461 | 5.3% |
| 5 | 162238 | 3.6% |
| 4 | 115719 | 2.6% |
| 3 | 93262 | 2.1% |
| 2 | 69930 | 1.6% |
state_code
Real number (ℝ)
MISSING 
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 165896 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.964869 |
| Minimum | 1 |
|---|---|
| Maximum | 72 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 12 |
| median | 27 |
| Q3 | 42 |
| 95-th percentile | 53 |
| Maximum | 72 |
| Range | 71 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 16.330678 |
|---|---|
| Coefficient of variation (CV) | 0.5839712 |
| Kurtosis | -1.2988164 |
| Mean | 27.964869 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 0.054807516 |
| Sum | 1.1998939 × 108 |
| Variance | 266.69104 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 48 | 401535 | 9.0% |
| 6 | 380461 | 8.5% |
| 12 | 353025 | 7.9% |
| 17 | 164128 | 3.7% |
| 13 | 162564 | 3.6% |
| 39 | 160341 | 3.6% |
| 36 | 155299 | 3.5% |
| 37 | 152544 | 3.4% |
| 42 | 148457 | 3.3% |
| 26 | 132121 | 3.0% |
| Other values (42) | 2080244 | |
| (Missing) | 165896 | 3.7% |
| Value | Count | Frequency (%) |
| 1 | 57102 | 1.3% |
| 2 | 7281 | 0.2% |
| 4 | 128654 | 2.9% |
| 5 | 32172 | 0.7% |
| 6 | 380461 | |
| 8 | 110776 | 2.5% |
| 9 | 46330 | 1.0% |
| 10 | 13871 | 0.3% |
| 11 | 9377 | 0.2% |
| 12 | 353025 |
| Value | Count | Frequency (%) |
| 72 | 10673 | 0.2% |
| 56 | 6667 | 0.1% |
| 55 | 77930 | 1.7% |
| 54 | 14906 | 0.3% |
| 53 | 118035 | 2.6% |
| 51 | 113960 | 2.6% |
| 50 | 6476 | 0.1% |
| 49 | 62389 | 1.4% |
| 48 | 401535 | |
| 47 | 97799 | 2.2% |
county_code
Real number (ℝ)
MISSING 
| Distinct | 322 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 165896 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87.772382 |
| Minimum | 1 |
|---|---|
| Maximum | 840 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 68.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 29 |
| median | 65 |
| Q3 | 111 |
| 95-th percentile | 221 |
| Maximum | 840 |
| Range | 839 |
| Interquartile range (IQR) | 82 |
Descriptive statistics
| Standard deviation | 99.219352 |
|---|---|
| Coefficient of variation (CV) | 1.1304165 |
| Kurtosis | 14.040813 |
| Mean | 87.772382 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 3.1935013 |
| Sum | 3.7660662 × 108 |
| Variance | 9844.4798 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 13 | 152295 | 3.4% |
| 31 | 140248 | 3.1% |
| 3 | 138386 | 3.1% |
| 37 | 109257 | 2.5% |
| 1 | 102351 | 2.3% |
| 59 | 87640 | 2.0% |
| 5 | 85793 | 1.9% |
| 29 | 78275 | 1.8% |
| 35 | 76930 | 1.7% |
| 11 | 75116 | 1.7% |
| Other values (312) | 3244428 | |
| (Missing) | 165896 | 3.7% |
| Value | Count | Frequency (%) |
| 1 | 102351 | |
| 3 | 138386 | |
| 5 | 85793 | |
| 6 | 54 | < 0.1% |
| 7 | 43918 | 1.0% |
| 9 | 54394 | 1.2% |
| 11 | 75116 | |
| 12 | 63 | < 0.1% |
| 13 | 152295 | |
| 14 | 1653 | < 0.1% |
| Value | Count | Frequency (%) |
| 840 | 282 | < 0.1% |
| 830 | 138 | < 0.1% |
| 820 | 442 | < 0.1% |
| 810 | 5366 | |
| 800 | 1196 | < 0.1% |
| 790 | 387 | < 0.1% |
| 775 | 324 | < 0.1% |
| 770 | 1310 | < 0.1% |
| 760 | 2943 | |
| 750 | 79 | < 0.1% |
census_tract
Text
MISSING 
| Distinct | 71951 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 167065 |
| Missing (%) | 3.7% |
| Memory size | 68.0 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.999996 |
| Min length | 2 |
Characters and Unicode
| Total characters | 47185032 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 783 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 19081270100 |
|---|---|
| 2nd row | 19081270200 |
| 3rd row | 19169010600 |
| 4th row | 19081270100 |
| 5th row | 19081270100 |
| Value | Count | Frequency (%) |
| 48157672900 | 2287 | 0.1% |
| 48201542900 | 1920 | < 0.1% |
| 48157673200 | 1769 | < 0.1% |
| 48085030305 | 1574 | < 0.1% |
| 48085030203 | 1573 | < 0.1% |
| 48157673101 | 1520 | < 0.1% |
| 48439114103 | 1483 | < 0.1% |
| 48157673400 | 1479 | < 0.1% |
| 48201543002 | 1297 | < 0.1% |
| 49035113107 | 1291 | < 0.1% |
| Other values (71941) | 4273357 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 15876296 | |
| 1 | 7493303 | |
| 3 | 4337380 | 9.2% |
| 2 | 4260146 | 9.0% |
| 5 | 3144274 | 6.7% |
| 4 | 3086982 | 6.5% |
| 7 | 2519965 | 5.3% |
| 9 | 2420755 | 5.1% |
| 6 | 2190757 | 4.6% |
| 8 | 1855170 | 3.9% |
| Other values (2) | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 47185028 | |
| Uppercase Letter | 2 | < 0.1% |
| Lowercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15876296 | |
| 1 | 7493303 | |
| 3 | 4337380 | 9.2% |
| 2 | 4260146 | 9.0% |
| 5 | 3144274 | 6.7% |
| 4 | 3086982 | 6.5% |
| 7 | 2519965 | 5.3% |
| 9 | 2420755 | 5.1% |
| 6 | 2190757 | 4.6% |
| 8 | 1855170 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 47185028 | |
| Latin | 4 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 15876296 | |
| 1 | 7493303 | |
| 3 | 4337380 | 9.2% |
| 2 | 4260146 | 9.0% |
| 5 | 3144274 | 6.7% |
| 4 | 3086982 | 6.5% |
| 7 | 2519965 | 5.3% |
| 9 | 2420755 | 5.1% |
| 6 | 2190757 | 4.6% |
| 8 | 1855170 | 3.9% |
Latin
| Value | Count | Frequency (%) |
| N | 2 | |
| a | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47185032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 15876296 | |
| 1 | 7493303 | |
| 3 | 4337380 | 9.2% |
| 2 | 4260146 | 9.0% |
| 5 | 3144274 | 6.7% |
| 4 | 3086982 | 6.5% |
| 7 | 2519965 | 5.3% |
| 9 | 2420755 | 5.1% |
| 6 | 2190757 | 4.6% |
| 8 | 1855170 | 3.9% |
| Other values (2) | 4 | < 0.1% |
activity_year
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 MiB |
| 2019 |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 17826460 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2019 |
|---|---|
| 2nd row | 2019 |
| 3rd row | 2019 |
| 4th row | 2019 |
| 5th row | 2019 |
Common Values
| Value | Count | Frequency (%) |
| 2019 | 4456615 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2019 | 4456615 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4456615 | |
| 0 | 4456615 | |
| 1 | 4456615 | |
| 9 | 4456615 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17826460 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4456615 | |
| 0 | 4456615 | |
| 1 | 4456615 | |
| 9 | 4456615 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17826460 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4456615 | |
| 0 | 4456615 | |
| 1 | 4456615 | |
| 9 | 4456615 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17826460 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4456615 | |
| 0 | 4456615 | |
| 1 | 4456615 | |
| 9 | 4456615 |
loan_outcome
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 MiB |
| 1 | |
|---|---|
| 4 | |
| 3 | 318170 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4456615 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 3215722 | |
| 4 | 922723 | 20.7% |
| 3 | 318170 | 7.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 3215722 | |
| 4 | 922723 | 20.7% |
| 3 | 318170 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3215722 | |
| 4 | 922723 | 20.7% |
| 3 | 318170 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4456615 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3215722 | |
| 4 | 922723 | 20.7% |
| 3 | 318170 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4456615 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3215722 | |
| 4 | 922723 | 20.7% |
| 3 | 318170 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4456615 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3215722 | |
| 4 | 922723 | 20.7% |
| 3 | 318170 | 7.1% |
lender_id
Text
| Distinct | 5144 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 68.0 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Characters and Unicode
| Total characters | 89132300 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 91 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 25490003YGASV5ENH153 |
|---|---|
| 2nd row | 25490003YGASV5ENH153 |
| 3rd row | 25490003YGASV5ENH153 |
| 4th row | 25490003YGASV5ENH153 |
| 5th row | 25490003YGASV5ENH153 |
| Value | Count | Frequency (%) |
| 549300fgxn1k3hlb1r50 | 167281 | 3.8% |
| 549300hw662mn1wu8550 | 165883 | 3.7% |
| kb1h1dsprfmymcufxt09 | 149098 | 3.3% |
| 549300mgpzblqdil7538 | 102803 | 2.3% |
| b4tydeb6gkmzo031mb27 | 89358 | 2.0% |
| 7h6glxdrugqfu57rne97 | 84340 | 1.9% |
| 549300j7xkt2bi5wx213 | 77327 | 1.7% |
| 549300ag64nhilb7zp05 | 63864 | 1.4% |
| 549300u3721pjgqzyy68 | 61108 | 1.4% |
| 549300aq3t62gxdu7d76 | 52230 | 1.2% |
| Other values (5134) | 3443323 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9786828 | 11.0% |
| 5 | 6314133 | 7.1% |
| 3 | 5826707 | 6.5% |
| 4 | 5696358 | 6.4% |
| 9 | 5046116 | 5.7% |
| 1 | 3231566 | 3.6% |
| 2 | 2809968 | 3.2% |
| 7 | 2644075 | 3.0% |
| 6 | 2573103 | 2.9% |
| 8 | 2147321 | 2.4% |
| Other values (26) | 43056125 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 46076175 | |
| Uppercase Letter | 43056125 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 2069572 | 4.8% |
| H | 2026636 | 4.7% |
| D | 2009257 | 4.7% |
| M | 2002468 | 4.7% |
| N | 1970014 | 4.6% |
| R | 1942877 | 4.5% |
| S | 1826433 | 4.2% |
| W | 1817562 | 4.2% |
| G | 1810619 | 4.2% |
| L | 1797071 | 4.2% |
| Other values (16) | 23783616 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9786828 | |
| 5 | 6314133 | |
| 3 | 5826707 | |
| 4 | 5696358 | |
| 9 | 5046116 | |
| 1 | 3231566 | 7.0% |
| 2 | 2809968 | 6.1% |
| 7 | 2644075 | 5.7% |
| 6 | 2573103 | 5.6% |
| 8 | 2147321 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 46076175 | |
| Latin | 43056125 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 2069572 | 4.8% |
| H | 2026636 | 4.7% |
| D | 2009257 | 4.7% |
| M | 2002468 | 4.7% |
| N | 1970014 | 4.6% |
| R | 1942877 | 4.5% |
| S | 1826433 | 4.2% |
| W | 1817562 | 4.2% |
| G | 1810619 | 4.2% |
| L | 1797071 | 4.2% |
| Other values (16) | 23783616 |
Common
| Value | Count | Frequency (%) |
| 0 | 9786828 | |
| 5 | 6314133 | |
| 3 | 5826707 | |
| 4 | 5696358 | |
| 9 | 5046116 | |
| 1 | 3231566 | 7.0% |
| 2 | 2809968 | 6.1% |
| 7 | 2644075 | 5.7% |
| 6 | 2573103 | 5.6% |
| 8 | 2147321 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 89132300 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9786828 | 11.0% |
| 5 | 6314133 | 7.1% |
| 3 | 5826707 | 6.5% |
| 4 | 5696358 | 6.4% |
| 9 | 5046116 | 5.7% |
| 1 | 3231566 | 3.6% |
| 2 | 2809968 | 3.2% |
| 7 | 2644075 | 3.0% |
| 6 | 2573103 | 2.9% |
| 8 | 2147321 | 2.4% |
| Other values (26) | 43056125 |
| age | co_applicant | county_code | credit_model | debt_to_income_ratio | income | lender_size | lender_type | loan_amount | loan_outcome | main_underwriter | metro_size_percentile | mortgage_term | property_value_ratio | race | sex | state_code | tract_to_metro_income_percentage | white_population_pct | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.047 | -0.014 | 0.013 | 0.026 | 0.107 | 0.016 | 0.036 | 0.039 | 0.033 | 0.019 | -0.006 | 0.073 | 0.145 | -0.012 | 0.019 | -0.047 | 0.060 | 0.039 |
| co_applicant | 0.047 | 1.000 | 0.022 | -0.086 | 0.054 | -0.365 | 0.012 | 0.042 | -0.221 | 0.030 | -0.002 | -0.003 | 0.039 | -0.232 | -0.025 | 0.096 | -0.005 | 0.079 | -0.082 |
| county_code | -0.014 | 0.022 | 1.000 | -0.005 | -0.009 | -0.056 | -0.038 | 0.103 | -0.144 | 0.038 | 0.004 | 0.025 | 0.068 | 0.062 | -0.005 | 0.033 | 0.213 | 0.125 | -0.017 |
| credit_model | 0.013 | -0.086 | -0.005 | 1.000 | 0.442 | 0.025 | -0.110 | 0.085 | 0.005 | 0.463 | 0.172 | -0.004 | 0.176 | 0.007 | 0.002 | 0.021 | -0.000 | 0.036 | 0.002 |
| debt_to_income_ratio | 0.026 | 0.054 | -0.009 | 0.442 | 1.000 | -0.179 | -0.043 | 0.145 | 0.012 | 0.603 | 0.168 | 0.011 | 0.547 | -0.098 | 0.021 | 0.041 | -0.029 | 0.059 | -0.089 |
| income | 0.107 | -0.365 | -0.056 | 0.025 | -0.179 | 1.000 | 0.077 | 0.000 | 0.712 | 0.002 | 0.032 | 0.146 | 0.000 | 0.563 | -0.026 | 0.005 | -0.038 | 0.002 | 0.044 |
| lender_size | 0.016 | 0.012 | -0.038 | -0.110 | -0.043 | 0.077 | 1.000 | 0.946 | 0.139 | 0.230 | 0.015 | 0.123 | 0.538 | 0.015 | 0.026 | 0.161 | -0.089 | 0.289 | -0.104 |
| lender_type | 0.036 | 0.042 | 0.103 | 0.085 | 0.145 | 0.000 | 0.946 | 1.000 | 0.007 | 0.077 | -0.223 | 0.069 | 0.144 | -0.120 | 0.057 | 0.028 | -0.041 | 0.142 | -0.141 |
| loan_amount | 0.039 | -0.221 | -0.144 | 0.005 | 0.012 | 0.712 | 0.139 | 0.007 | 1.000 | 0.001 | 0.018 | 0.200 | 0.001 | 0.579 | -0.016 | 0.000 | -0.113 | 0.000 | -0.079 |
| loan_outcome | 0.033 | 0.030 | 0.038 | 0.463 | 0.603 | 0.002 | 0.230 | 0.077 | 0.001 | 1.000 | 0.167 | -0.077 | 0.024 | -0.044 | 0.007 | 0.029 | -0.011 | 0.227 | -0.054 |
| main_underwriter | 0.019 | -0.002 | 0.004 | 0.172 | 0.168 | 0.032 | 0.015 | -0.223 | 0.018 | 0.167 | 1.000 | -0.006 | 0.244 | 0.031 | -0.012 | 0.032 | -0.007 | 0.098 | 0.006 |
| metro_size_percentile | -0.006 | -0.003 | 0.025 | -0.004 | 0.011 | 0.146 | 0.123 | 0.069 | 0.200 | -0.077 | -0.006 | 1.000 | 0.085 | -0.006 | -0.002 | 0.018 | -0.039 | 0.373 | -0.229 |
| mortgage_term | 0.073 | 0.039 | 0.068 | 0.176 | 0.547 | 0.000 | 0.538 | 0.144 | 0.001 | 0.024 | 0.244 | 0.085 | 1.000 | 0.090 | -0.023 | 0.028 | 0.052 | 0.041 | 0.123 |
| property_value_ratio | 0.145 | -0.232 | 0.062 | 0.007 | -0.098 | 0.563 | 0.015 | -0.120 | 0.579 | -0.044 | 0.031 | -0.006 | 0.090 | 1.000 | -0.030 | 0.000 | 0.038 | 0.000 | 0.201 |
| race | -0.012 | -0.025 | -0.005 | 0.002 | 0.021 | -0.026 | 0.026 | 0.057 | -0.016 | 0.007 | -0.012 | -0.002 | -0.023 | -0.030 | 1.000 | 0.389 | -0.016 | 0.101 | -0.022 |
| sex | 0.019 | 0.096 | 0.033 | 0.021 | 0.041 | 0.005 | 0.161 | 0.028 | 0.000 | 0.029 | 0.032 | 0.018 | 0.028 | 0.000 | 0.389 | 1.000 | -0.006 | 0.039 | -0.047 |
| state_code | -0.047 | -0.005 | 0.213 | -0.000 | -0.029 | -0.038 | -0.089 | -0.041 | -0.113 | -0.011 | -0.007 | -0.039 | 0.052 | 0.038 | -0.016 | -0.006 | 1.000 | 0.088 | 0.142 |
| tract_to_metro_income_percentage | 0.060 | 0.079 | 0.125 | 0.036 | 0.059 | 0.002 | 0.289 | 0.142 | 0.000 | 0.227 | 0.098 | 0.373 | 0.041 | 0.000 | 0.101 | 0.039 | 0.088 | 1.000 | 0.265 |
| white_population_pct | 0.039 | -0.082 | -0.017 | 0.002 | -0.089 | 0.044 | -0.104 | -0.141 | -0.079 | -0.054 | 0.006 | -0.229 | 0.123 | 0.201 | -0.022 | -0.047 | 0.142 | 0.265 | 1.000 |
| race | sex | co_applicant | age | income | loan_amount | property_value_ratio | mortgage_term | credit_model | debt_to_income_ratio | combined_loan_to_value_ratio | main_underwriter | tract_to_metro_income_percentage | lender_type | lender_size | white_population_pct | metro_name | metro_size_percentile | state_code | county_code | census_tract | activity_year | loan_outcome | lender_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 222 | 5 | 2 | 2 | 2 | 23.0 | 25000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 94.487578 | NaN | 000 | 19 | 081 | 19081270100 | 2019 | 1 | 25490003YGASV5ENH153 |
| 230 | 5 | 1 | 2 | 2 | 42.0 | 85000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 93.845535 | NaN | 000 | 19 | 081 | 19081270200 | 2019 | 1 | 25490003YGASV5ENH153 |
| 234 | 5 | 1 | 1 | 2 | 125.0 | 95000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 96.340348 | Ames, IA | 1 | 19 | 169 | 19169010600 | 2019 | 1 | 25490003YGASV5ENH153 |
| 245 | 5 | 1 | 2 | 1 | 34.0 | 75000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 94.487578 | NaN | 000 | 19 | 081 | 19081270100 | 2019 | 1 | 25490003YGASV5ENH153 |
| 260 | 5 | 2 | 2 | 2 | 37.0 | 145000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 94.487578 | NaN | 000 | 19 | 081 | 19081270100 | 2019 | 1 | 25490003YGASV5ENH153 |
| 261 | 5 | 2 | 1 | 4 | 57.0 | 75000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 92.326835 | NaN | 000 | 19 | 081 | 19081270300 | 2019 | 1 | 25490003YGASV5ENH153 |
| 269 | 5 | 1 | 2 | 1 | 27.0 | 65000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 93.845535 | NaN | 000 | 19 | 081 | 19081270200 | 2019 | 1 | 25490003YGASV5ENH153 |
| 276 | 5 | 1 | 2 | 1 | 32.0 | 75000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 94.487578 | NaN | 000 | 19 | 081 | 19081270100 | 2019 | 1 | 25490003YGASV5ENH153 |
| 281 | 5 | 1 | 2 | 2 | 35.0 | 25000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 93.845535 | NaN | 000 | 19 | 081 | 19081270200 | 2019 | 1 | 25490003YGASV5ENH153 |
| 289 | 5 | 1 | 1 | 3 | 123.0 | 175000 | NaN | 4 | 7 | 5 | Exempt | 7 | 3 | 1 | 107 | 92.326835 | NaN | 000 | 19 | 081 | 19081270300 | 2019 | 1 | 25490003YGASV5ENH153 |
| race | sex | co_applicant | age | income | loan_amount | property_value_ratio | mortgage_term | credit_model | debt_to_income_ratio | combined_loan_to_value_ratio | main_underwriter | tract_to_metro_income_percentage | lender_type | lender_size | white_population_pct | metro_name | metro_size_percentile | state_code | county_code | census_tract | activity_year | loan_outcome | lender_id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 17545373 | 2 | 2 | 1 | 4 | 133.0 | 655000 | NaN | 1 | 7 | 6 | NaN | 7 | 4 | 3 | 1894 | 78.854426 | Cambridge-Newton-Framingham, MA | 9 | 25 | 017 | 25017367100 | 2019 | 4 | 549300L0OVX5O63S8C68 |
| 17545376 | 7 | 3 | 2 | 5 | 2628.0 | 1985000 | NaN | 1 | 7 | 6 | NaN | 7 | 4 | 3 | 1894 | 82.484383 | Boston, MA | 9 | 25 | 025 | 25025060600 | 2019 | 4 | 549300L0OVX5O63S8C68 |
| 17545383 | 7 | 1 | 2 | 4 | 947.0 | 1495000 | 7.596 | 1 | 7 | 1 | 46.123 | 7 | 4 | 3 | 1894 | 79.474940 | Bridgeport-Stamford-Norwalk, CT | 8 | 09 | 001 | 09001011100 | 2019 | 1 | 549300L0OVX5O63S8C68 |
| 17545386 | 5 | 2 | 1 | 5 | 196.0 | 375000 | 0.929 | 1 | 7 | 1 | 80.0 | 1 | 3 | 3 | 1894 | 80.891304 | Cambridge-Newton-Framingham, MA | 9 | 25 | 017 | 25017317102 | 2019 | 1 | 549300L0OVX5O63S8C68 |
| 17545388 | 7 | 3 | 1 | 5 | 68.0 | 315000 | 1.618 | 1 | 3 | 3 | 65.235 | 7 | 4 | 3 | 1894 | 89.135066 | Providence-Warwick, RI-MA | 8 | 25 | 005 | 25005631700 | 2019 | 1 | 549300L0OVX5O63S8C68 |
| 17545390 | 5 | 2 | 1 | 5 | 196.0 | 395000 | NaN | 1 | 7 | 6 | NaN | 1 | 3 | 3 | 1894 | 67.825622 | Cambridge-Newton-Framingham, MA | 9 | 25 | 017 | 25017332300 | 2019 | 4 | 549300L0OVX5O63S8C68 |
| 17545392 | 5 | 1 | 1 | 3 | 365.0 | 865000 | 2.283 | 1 | 7 | 3 | 80.0 | 7 | 4 | 3 | 1894 | 93.231994 | Boston, MA | 9 | 25 | 021 | 25021409102 | 2019 | 1 | 549300L0OVX5O63S8C68 |
| 17545394 | 7 | 3 | 2 | 4 | 25.0 | 85000 | 0.339 | 1 | 3 | 3 | 90.0 | 2 | 1 | 3 | 1894 | 48.053528 | Worcester, MA-CT | 8 | 25 | 027 | 25027710700 | 2019 | 1 | 549300L0OVX5O63S8C68 |
| 17545395 | 5 | 2 | 1 | 2 | 318.0 | 685000 | 3.047 | 1 | 3 | 1 | 95.0 | 7 | 4 | 3 | 1894 | 92.470277 | Worcester, MA-CT | 8 | 25 | 027 | 25027715100 | 2019 | 1 | 549300L0OVX5O63S8C68 |
| 17545397 | 7 | 2 | 1 | 2 | 315.0 | 1645000 | NaN | 1 | 7 | 6 | NaN | 7 | 4 | 3 | 1894 | 64.553795 | Boston, MA | 9 | 25 | 025 | 25025070800 | 2019 | 4 | 549300L0OVX5O63S8C68 |
Most frequently occurring
| race | sex | co_applicant | age | income | loan_amount | property_value_ratio | mortgage_term | credit_model | debt_to_income_ratio | combined_loan_to_value_ratio | main_underwriter | tract_to_metro_income_percentage | lender_type | lender_size | white_population_pct | metro_name | metro_size_percentile | state_code | county_code | census_tract | activity_year | loan_outcome | lender_id | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3226 | 7 | 3 | 2 | 3 | 102.0 | 305000 | NaN | 1 | 7 | 6 | NaN | 7 | 1 | 3 | 99472 | 15.104446 | Atlanta-Sandy Springs-Alpharetta, GA | 99 | 13 | 121 | 13121011800 | 2019 | 4 | 549300VORTI31GZTJL53 | 37 |
| 3227 | 7 | 3 | 2 | 3 | 102.0 | 305000 | NaN | 4 | 7 | 6 | NaN | 7 | 1 | 3 | 99472 | 15.104446 | Atlanta-Sandy Springs-Alpharetta, GA | 99 | 13 | 121 | 13121011800 | 2019 | 4 | 549300VORTI31GZTJL53 | 26 |
| 3225 | 7 | 3 | 2 | 3 | 102.0 | 305000 | NaN | 1 | 7 | 6 | NaN | 1 | 1 | 3 | 99472 | 15.104446 | Atlanta-Sandy Springs-Alpharetta, GA | 99 | 13 | 121 | 13121011800 | 2019 | 4 | 549300VORTI31GZTJL53 | 22 |
| 2175 | 5 | 2 | 2 | 3 | 102.0 | 305000 | NaN | 1 | 7 | 6 | NaN | 7 | 1 | 3 | 99472 | 15.104446 | Atlanta-Sandy Springs-Alpharetta, GA | 99 | 13 | 121 | 13121011800 | 2019 | 4 | 549300VORTI31GZTJL53 | 9 |
| 544 | 3 | 2 | 2 | 3 | 90.0 | 355000 | NaN | 1 | 7 | 6 | NaN | 7 | 1 | 3 | 282102 | 5.413345 | Oakland-Berkeley-Livermore, CA | 9 | 06 | 001 | 06001408800 | 2019 | 4 | 549300J7XKT2BI5WX213 | 4 |
| 1525 | 5 | 1 | 2 | 3 | 600.0 | 1525000 | NaN | 2 | 7 | 6 | NaN | 7 | 4 | 1 | 25968 | 88.379205 | Dallas-Plano-Irving, TX | 9 | 48 | 113 | 48113013004 | 2019 | 4 | FU7RSW4CQQY98A2O7J66 | 4 |
| 2993 | 7 | 1 | 2 | 4 | 139.0 | 415000 | NaN | 1 | 7 | 6 | NaN | 7 | 4 | 3 | 14083 | 84.144976 | Elgin, IL | 7 | 17 | 089 | 17089852101 | 2019 | 4 | 5493009DTDMV4MI5MT96 | 4 |
| 3201 | 7 | 3 | 2 | 3 | 38.0 | 95000 | NaN | 1 | 7 | 6 | NaN | 7 | 2 | 3 | 99472 | 30.083397 | Baton Rouge, LA | 8 | 22 | 005 | 22005031000 | 2019 | 4 | 549300VORTI31GZTJL53 | 4 |
| 3224 | 7 | 3 | 2 | 3 | 96.0 | 325000 | NaN | 1 | 7 | 6 | NaN | 7 | 4 | 3 | 99472 | 70.451405 | Phoenix-Mesa-Chandler, AZ | 9 | 04 | 013 | 04013422646 | 2019 | 4 | 549300VORTI31GZTJL53 | 4 |
| 3228 | 7 | 3 | 2 | 3 | 102.0 | 305000 | NaN | 4 | 7 | 6 | NaN | 7 | 3 | 3 | 99472 | 35.797665 | Fayetteville, NC | 6 | 37 | 093 | 37093970101 | 2019 | 4 | 549300VORTI31GZTJL53 | 4 |